Automatic Music Detection in Television Productions

نویسندگان

  • Klaus Seyerlehner
  • Tim Pohle
  • Markus Schedl
  • Gerhard Widmer
چکیده

This paper presents methods for the automatic detection of music within audio streams, in the foreor background. The problem occurs in the context of a real-world application, namely, the analysis of TV productions w.r.t. the use of music. In contrast to plain speech/music discrimination, the problem of detecting music in TV productions is extremely difficult, since music is often used to accentuate scenes while concurrently speech and any kind of noise signals might be present. We present results of extensive experiments with a set of standard machine learning algorithms and standard features, investigate the difference between frame-level and clip-level features, and demonstrate the importance of the application of smoothing functions as a post-processing step. Finally, we propose a new feature, called Continuous Frequency Activation (CFA), especially designed for music detection, and show experimentally that this feature is more precise than the other approaches in identifying segments with music in audio streams.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Robin Weiß Virtual Video Quality Bachelor Thesis Computer Graphics Lab, Tu Braunschweig

Recent free-viewpoint rendering systems, like the Virtual Video Camera (VVC) system developed at the Computer Graphics Laboratoy at the TU Braunschweig, allow for free viewpoint navigation through time and space and produce visually convincing results. These can be used for television or movie productions, as well as e.g. for music video productions. The results are prone to artifacts, however,...

متن کامل

Automatic Sample Detection in Polyphonic Music

The term ‘sampling’ refers to the usage of snippets or loops from existing songs or sample libraries in new songs, mashups, or other music productions. The ability to automatically detect sampling in music is, for instance, beneficial for studies tracking artist influences geographically and temporally. We present a method based on Non-negative Matrix Factorization (NMF) and Dynamic Time Warpin...

متن کامل

Automatic Identification and Classification of the Iranian Traditional Music Scales (Dastgāh) and Melody Models (Gusheh): Analytical and Comparative Review on Conducted Research

Background and Aim: Automatic identification and classification of the Iranian traditional music scales (Dastgāh) and melody models (Gusheh) has attracted the attention of the researchers for more than a decade. The current research aims to review conducted researches on this area and consider its different approached and obstacles. Method: The research approach is content analysis and data col...

متن کامل

Evaluation of real-time audio-visual speech recognition

In this paper, we propose and develop a real-time audio-visual automatic continuous speech recognition system. The system utilizes live speech signals and facial images that collected from a microphone and a camera. Optical-flow-based features are used as visual feature. VAD technology and lip tracking are utilized to improve recognition accuracy. In this paper, several experiments are conducte...

متن کامل

Live TV-Set with mobile Augmented Reality

To interact as audience to a live television production is today a common procedure. New is to use Augmented Reality support to reach the audience at home. Also new is the situation to have many Television Sets at private houses connected with the internet. Both are offering lots of possibilities for interactive live television productions. The author shows current productions, a by himself pro...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007